Ostrowski Numeration and the Local Period of Sturmian 

Words 



Luke Schaeffer 

"<n": 

School of Computer Science 
(N ■ University of Waterloo 

O ; Waterloo, ON N2L 3G1 Canada 

13schaef@cs. uwaterloo . ca 

00 

October 9, 2012 

o 

Abstract 

> 

m . 

t^j- . We show that the local period at position n in a characteristic Sturmian word can be given 

; in terms of the Ostrowski representation for n + 1. 

d 

1 Introduction 

> : 

^ ■ We consider characteristic Sturmian words, which are infinite words over {0, 1} such that the ith 

character is 

[a(i + 1)J — [ai\ — [a\ 

for some irrational a. We give an alternate definition later better suited to our purposes. Let f w (n) 
denote the number of factors of length n in w, also known as the subword complexity of 0(n). It is 
well-known that f w (n) = n+1 when w is a Sturmian word. On the other hand, the Coven-Hedlund 
theorem [4] states that f w (n) is either bounded or f w (n) > n + 1 for all n. In this sense, Sturmian 
words are extremal with respect to subword complexity. 

In a recent paper [3], Restivo and Mignosi show that characteristic Sturmian words are also ex- 
tremal with respect to local period, which we define shortly as part of Definition [2l Let p w (n) 
denote the local period of a word w at position n. The critical factorization theorem states that 
either p w (n) is bounded or p w (n) > n + 1 for infinitely many n. Restivo and Mignosi show that 
when w is a characteristic Sturmian word, p w (n) is at most n + 1 and p w (n) —n + 1 infinitely 
often. Hence, characteristic Sturmian words also have extremal local periods. 
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Table 1 : The local period function for the Fibonacci word. 

Unlike subword complexity, the local period function p w (n) is erratic. Consider Table [H which 
gives the local period at points in F, the Fibonacci word. Although there are patterns in the table 
(for example, each Pi?(n) is a Fibonacci number), it is not obvious how is related to n in 

general. Shallit [1J showed that pp(n) is easily computed from the Zeckendorf representation 
of n + 1, and conjectured that for a general characteristic Sturmian word w, p w (n) is a simple 
function of the corresponding Ostrowski representation for In this paper, we confirm Shallit's 
conjecture by describing p w (n) in terms of the Ostrowski representation for n + 1. 



2 Notation 

Let E := {0, 1} for the rest of this paper. We write w[n] to denote the nth letter of a word w (finite 
or infinite), and w[i..j] for the factor tu[i]tu[i + 1] • • ■ w[j — l]w[j]. We use the convention that the 
first character in w is w[0). Let \w\ denote the length of a finite word w. 

2.1 Repetition words 

Definition 1. Let w be an infinite word over a finite alphabet E. A repetition word in w at position 
i is a non-empty factor w[i..j] such that either w[i..j] is a prefix of w[0..i — 1] or w[0..i — 1] is a 
prefix of w[i..j). 

If the infinite word w is recurrent (i.e., every factor in w occurs more than once in w) then every 
factor occurs infinitely many times. In particular, for every % the prefix w[0..i — 1] occurs in w[i. .oo], 
so there exists a repetition word at every position in a recurrent word. 

Definition 2. Let w be an infinite recurrent word over a finite alphabet E. Let r w {i) denote the 
shortest repetition word in w at position i. The length of the shortest repetition word, denoted by 
Pw(i) '■= \r w (i) |, is called the local period in w at position i. 

We note that Sturmian words are recurrent, so p w (i) and r w (i) exist at every position for a charac- 
teristic Sturmian word w. We omit further discussion of the existence of p w (i) and r w (i). 

For example, consider the Fibonacci word F shown in Figure [J The factors F[5..6] = 01, 
F[5..9] = 01001 and F[5..17] = 0100100101001 are examples of repetition words in the Fi- 
bonacci word at position 5. The shortest repetition word at position 5 is r>(5) = F[5..6] = 01 and 
therefore the local period at position 5 is Pf{5) = 2. 
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Figure 1 : The Fibonacci word F and some repetition words at position 5 

3 Characteristic Sturmian Words and the Ostrowski Repre- 
sentation 

We define characteristic Sturmian words and the Ostrowski representation based on directive se- 
quences of integers, defined below. For every directive sequence there is a corresponding charac- 
teristic Sturmian word. Similarly, for each directive sequence there is an Ostrowski representation 
associating nonnegative integers with strings. 

Definition 3. A directive sequence a = {ai}°Z is a sequence of nonnegative integers, where 
Oj > for all i > 0. 

Directive sequences are in some sense infinite words over the natural numbers, so we use the same 
indexing/factor notation. The notation a[i) indicates the 2th term, eij. We will frequently separate a 
directive sequence a into the first term, a[0], and the rest of the sequence, a[l..oo]. 

Note that our definitions for characteristic Sturmian words and Ostrowski representations deviate 
slightly from the definitions given in our references, [0 and 0. Specifically, there are two main 
differences between our definition and fl2): 

1 . We start indexing the directive sequence at zero instead of one. 

2. The first term is interpreted differently. For example, if the first term in the sequence a then 
our characteristic Sturmian word begins with a l, whereas the characteristic Sturmian word 
in Gl begins withO^ 1 !. 

In other words, we are describing the same mathematical objects, but label them with slightly 
different directive sequences. Any result that does not explicitly reference the terms of the directive 
sequence will be true for either set of definitions. This includes our main result, Theorem PT3l 

3.1 Characteristic Sturmian Words 

Consider the following collection of morphisms. 
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Definition 4. For each k > 0, we define a moronism tp k : E* — > S* such that 

^fe(O) = k l 

VkO) = 

for all k > 0. 

Given a directive sequence, we use this collection of morphisms to construct a sequence of words. 

Definition 5. Let a be a directive sequence. We define a sequence of finite words {Xi}^l over £ 
where 

X n = (ip a [ ) o • • • o y? Q[n _i])(0). 
We call {Xi}°Z the standard sequence, and we say Xi is the characteristic block. 

Sometimes the characteristic blocks are defined recursively as follows. 

Proposition 6. Let a be a directive sequence and let {Xi}°l be the corresponding directive se- 
quence. Then 

{0, ifn = 0; 

0«Ml, ifn = l; 

X^ 1] X n _ 2 , ifn>2. 

Proof. See Theorem 9.1.8 in Q, Note that due to a difference in definitions, the authors number 
the directive sequence starting from one instead of zero, and they treat the first term differently 
(i.e., they define X 1 as O" 1 " 1 ! instead of a[0] l). □ 

It follows from the proposition that X n ^i is a prefix of X n for each n > 2, and therefore the limit 
lim^oo X n exists. We define c a , the characteristic Sturmian word corresponding to the directive 
sequence a, to be this limit. 

c a := lim X n . 

Then X n is a prefix of c a for each n > 2. 

There is a simple relationship between c Q , a[0] and c a [i.. 00 i, given in the following proposition. 
Proposition 7. Let a be a directive sequence, and let (3 := a[l..oo]. Then 

C a = (p a [0] (cp) 

Proof. (Sketch) We factor <p a ^ out of each Xi and then out of the limit. 

c a = lim (W 0] o • • • o (p a [ n -i])(Q) = (p a [ ] ( lim (ip a[1] o • • • o ^ a[n _i])(0)) = v? a[0 ] (c^) . 
Alternatively, see Theorem 9.1.8 in [2] for a similar result. □ 
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Notice that if a[0] =0 then c a and Cg are the same infinite word up to permutation of the alphabet, 
since p swaps and 1 . Permuting the alphabet does not affect the local period or repetition words, 
so henceforth we assume that the first term of any directive sequence is positive (and therefore all 
terms are positive). Consequently, all characteristic Sturmian words we consider will start with 
and avoid the factor 1 1 . 

Let us give an example of a characteristic Sturmian word. Consider the directive sequence a 
beginning 1, 3, 2, 2. Then we can compute the first five terms of the standard sequence 

^o = 

X 1 = 01 

X 2 = 0101010 

X 3 = 0101010010101001 

X 4 = 010101001010100101010100101010010101010. 

We know X 4 is a prefix of c a , so we can deduce the first |X 4 | = 39 characters of c a . Thus, 

c a = 010101001010100101010100101010010101010- ■ ■ 

By Proposition |Vj c a is equal to </?i(c a [i.. 00 ]). 

c a = 0101010 0101010 010101010 0101010 010101010 - ■ ■ 
= (0001000100001000100001 • • •)■ 

3.2 Ostrowski representation 

For each directive sequence a, there is a corresponding characteristic Sturmian word c a . For 
each characteristic Sturmian word there is a numeration system, the Ostrowski representation, 
which is closely related to the standard sequence. For example, if the directive sequence is a = 
1,1,1,... then c Q is F, the Fibonacci word. The Ostrowski representation for a = 1, 1, 1, ... is 
the Zeckendorf representation, where we write an integer as a sum of Fibonacci numbers. See 
chapter three in [O for a description of these numeration systems, but note that their definition of 
Ostrowski representation differs from our definition. 

Definition 8. Let a be a directive sequence, and let {X;}°^ be the corresponding standard se- 
quence. Define an integer sequence {gi}^ where % = \Xi\ for all i > 0. Let n > be an integer. 
An a-Ostrowski representation (or simply Ostrowski representation when a is understood) for n 
is a sequence of non-negative integers {a?j}°^ such that 

1. Only finitely many d{ are nonzero. 

2. n = J2i diqi 

3. < d t < a[i] foralH > 0. 
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4. If di = a[i] then dj_i = for all i > 1. 

Note that by Proposition [61 we can also generate {qi}°Z directly from a using the following 
recurrence 

{1, ifn = 0; 

a[0] + 1, ifn=l; 
- 1] + g n -2, if n > 2. 

It is well-known that for any given directive sequence, there is a unique Ostrowski representation, 
which we denote OR a (n), for every non-negative integer 0. Also note that formally OR a (n) 
is an infinite sequence {di}^ , but we often write the terms up to the last nonzero term, e.g., 
dkdk-i ■ ■ ■ dido, with the understanding that di = for i > k. This is analogous to decimal 
representation of integers, where we write the least significant digit last and omit leading zeros. 

Theorem 9. Let a be a directive sequence. Let n > be an integer, and let d^d^-i ■ ■ ■ did be an 
Ostrowski representation for n. Then 

is a proper prefix ofX k+1 , and therefore w is a prefix ofc a . Since \w\ = ^ (4 = n, it follows 
that w = c a [0..n — 1]. 

Proof. This is essentially Theorem 9.1.13 in 0. □ 

The following technical lemma relates Ostrowski representations for a and a[l..oo], in much the 
same way that Proposition|7]relates c a to c a \i MMOO ]. 

Lemma 10. Let a be a directive sequence and define (5 := a[l..oo]. Let n > be an integer 
with Ostrowski representation OR a (n) = dk • • • d^. Then there exists an integer m > such that 
ORp(m) = dk ■ ■ ■ di and 

c a [0..n-l} = V 9 a[0] (c /3 [0..m-l])0 dt '. 
Furthermore, ifdo>0 then c p [m] = 0. 

Proof. We leave it to the reader to show that if dk • ■ ■ d is an a-Ostrowski representation then 
dk • • • d\ is a /^-Ostrowski representation, and conversely, if dk • • • d± is a /3-Ostrowski representa- 
tion then dk • • • d\0 is an a-Ostrowski representation. Theorem [9] proves that 

c a [0..n - 1] = X d k k xt'i ■ ■ ■ X^Xj = c p [0..m - l]0 do . 

Finally, suppose that do > and cp[m] = 1 for a contradiction. We consider the integer n — do + 1 
and its Ostrowski representations. On the one hand, dk • • ■ dxl is a valid Ostrowski representation 
and d Q — 1 less than n. On the other hand, 

c a [0..n - d ] = ip a [ O ]{cp[0..m - 1])0 = ip a [ O ]{cp[0..m]), 
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so OR^(m + 1) followed by is another Ostrowski representation for n — d + 1. This contradicts 
the uniqueness of Ostrowski representations. □ 



Let us continue our earlier example, where we had a directive sequence a beginning 1,3,2,2. We 
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In Table[2l we show Ostrowski representations for some small integers. By Theorem[9l we should 
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Table 2: Ostrowski representations where a = 1, 3, 2, 2, • • • 

be able to decompose c a [0..20] as X 3 XfX since OR a (21) = 1021. 

c Q [0..20] = 010101001010100101010 
= (0101010010101001)(01) 2 



4 Local periods in characteristic Sturmian words 

Let a be a directive sequence. Let p a (n) := p Ca ( n ) an d r a (n) := r Ca (n) be notation for the local 
period and shortest repetition word for characteristic Sturmian words. In this section we discuss 
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how p a {n) and r a (n) are related to OR a (n + 1). 

Definition 11. Let x, y be words in £*. Then a; is a conjugate of y if there exist words u, v E S* 
such that x = uv and y = vu. 

Lemma 12. Let a be a directive sequence, let j3 := a [L.oo] and k := a[0]. Suppose we have 
integers m, n > such that c a [0 . .n] = </?fc(cg[0..m]). Then 

(i) If u is a repetition word in Cg at position m then there exists a repetition word v in c a at 
position n such that (fk{u) is a conjugate of v. 

(ii) If v is a repetition word in c a at position n then there exists a repetition word u in cp at 
position m such that fk{u) is a conjugate of v. 

In particular, r a {n) is a conjugate of ^p k {r ^{m)) when c Q [0..n] = i^(cg[0..m]). 

Proof. We divide into two cases based on whether Cg [m] is or 1 . The situation when Cg [m] = is 
shown in Figure|2l and Cg[m] = 1 is shown in Figure|3j These figures, along with the more detailed 
diagrams in Figures @] and |5] later in the proof, indicate how tpk maps blocks in to blocks in c a . 



Cp{0..m} 



1 



fc l 



c a [0..n] 



fe l 



c a [0..n} 



Figure 2: Simple diagram for Cp[m) = 



Figure 3: Simple diagram for Cp[m) = 1 



Case Cp[m\ = 0: 

Clearly c a [0..n] ends with fc l = ^(0) since Cp[m] = 0. This gives us Figured 

(i) Let u be a repetition word in Cp at position m. If c^CLm — 1] is a suffix of u then 
certainly c a [0..n — 1] = (p k (cp[0..m — 1]) is a suffix of (fk(u). 

Suppose that u is a suffix of c^fCm — 1]. Since Cg[m] = we know u begins with 
and write u = 0u'. Since u' is a prefix of c^fm + L.oo], we see that v' := ipk(u') is a 
prefix of c a [n + L.oo]. The prefix u' in Cg[m + L.oo] is followed by 00, 01 or 10. Since 
V?fc(00), V2fc(01) and v?fc(10) all start with at least k zeros, we deduce that v' (as it occurs 
at the beginning of c a [n + L.oo]) is followed by k zeros. Thus, v := lt>'0 fc is a prefix 
of c Q [n..oo]. From the other occurrence of u (as a suffix of Cg[0..m — 1]) we deduce 
that lv'Q k is also a suffix of c Q [0..n — 1]. We conclude that v is a repetition word in 
c Q at position, and note that v = lv'0 k is a conjugate of Q k lv' = (fk(0u') = <pk(u), as 
required. 
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Figure 4: Detailed diagram for Cp[m] = 

(ii) Let v be a repetition word in c a at position n. The 1 at position n is preceded by k 
zeros. Hence, c a [0..n — 1] ends in k , so v ends in fc . Clearly v begins with 1, let v' be 
such that v = lv'O h . We do not know whether the trailing k is the beginning of ^fe(O) 
or cp k (10), but in either case v' is ipk{u') for it' a factor of c^. 

If c a [0..n — 1] is a proper suffix of v then c a [0..n — k — 1] is a suffix of ?/. Then 
c^fO-.m — 1] is a suffix of u', and hence it := Ou' is a repetition word in c@ at position 
m such that f is a conjugate of (fk{u). 

Otherwise, v is a suffix of c Q [0..n — 1]. The trailing fe in this occurrence of v is in the 
image of Cp[m] = 0. The remaining lv' must be preceded by fc , and then k lv' is the 
image of Ou', which occurs as a suffix of Cp[0..m — 1]. Now we have the situation in 
Figure|U It follows that u := Ou' is a repetition word, and v = lv'0 k is a conjugate of 

Mu) = k lv'. 

Case Cp[m] = 1: 

The characteristic Sturmian words we consider start with 0, so m ^ 0. Since Cp does not 
contain the factor 11, we know Cp[m — 1] = 0. Therefore c a [0..n] ends in </? fc (01) = fc 10, 
as shown in Figure [3l 
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\ I N 

^ \ ^ \ 
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\ \ s s v 





v' 


fc l v' k l 



V V 



Figure 5: Detailed diagram for Cp [m] = 1 

(i) Suppose u is a repetition word in Cp at position m, and let v := (pk(u). We know that 
<fk(cp[0..m — 1]) = c a [0..n — 1] and (pk(cp[m..oo]) = c a [n..oo]. Thus, 

• v is a prefix of c Q [n..oo] if u is a prefix of c^fm.-oo] 

• v is a suffix of c a [0..n — 1] if u is a suffix of cp[0..m — 1] 

• c a [0..n — 1] is a suffix of v if Cp[0..m — 1] is a suffix of u. 
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It follows that v is a repetition word in c a at position n. 

(ii) Suppose v is a repetition word in c a at position n. We know v starts with since 
c a [n] = 0, and v ends with 1 since c a [n — 1] = 1, therefore w = 0i/0 fc l for some u'. 
Then t>' = ipk(u') for some u', and we define u := ltt'O so that 

<p k ( u ) = ^(lw'O) = 0t/0 fc l = v. 

It is also clear that 

• u is a prefix of Cg[m..oo] 

• m is a suffix of C/jfO-.m — 1] if v is a suffix of c a [0..n — 1] 

• c^fO-.m — 1] is a suffix of k if c a [0..n — 1] is a suffix of 

so we conclude that u is a repetition word in Cp at position m. 

□ 

Theorem 13. Let a be a directive sequence and let (3 := a[l..oo]. Let n > be a nonnegative 
integer. Let t be the number of trailing zeros in OR a (n + 1). Then r a (n) is a conjugate of X t , 
except when all of the following conditions are met: 

• The last nonzero digit in OR a (n + 1) is 1. 

• OR a (n + 1) contains at least two nonzero digits. 

• The last two nonzero digits ofOR a (n + 1) are separated by an even number of zeros. 

When OR a (n + 1) meets these conditions, then r a (n) is a conjugate ofX t+ % . 

Proof. Let dk • • • do = OR a (n + 1) be the Ostrowski representation of n + 1. Let t be the number 
of trailing zeros in OR a (n + 1). We use induction on t to prove that r a (n) is a conjugate of X t , or 
under the conditions described above, a conjugate of X t +i. 

Base case t = 0: Since n + 1 > 0, we have d > 0. By Theorem HI we have 

c a [0..n]=X d k k ---X^. 

If do > 2 then we are done since c a [0..n] ends in 00. Hence c a \n — 1] = c a [n] = and 
r a (n) = = Xq is the shortest repetition word at position n. Let us assume without loss of 
generality that d = 1. 

According to the induction hypothesis, the second last nonzero digit in OR Q (n + 1) becomes 
relevant when the last nonzero digit is 1 . If d is the only nonzero digit, then n = and 
r a (0) is clearly c a [0] = 0. Otherwise, pick i > minimal such that d e ^ 0. That is, let di be 
the second last nonzero digit. Note that by Theorem[9l the word c a [0..n — 1] ends in X e . 
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If £ is even then X e ends in (by a simple induction), so c a [n — 1] =0 and it follows that 
r a( n ) — 0. When £ is odd, the word X e ends in X x and Xi ends in 1. It follows that 

c a [0..n - 1] = y a [o](C/3[0..m - 1]) 

for some m > 0. We claim that Cp[m] = 0, since otherwise 

(Pa[o](c (3 [0..m}) = c a [0..n] 

so LemmaflOl states that OR a (n + 1) ends in 0, contradicting d = 1. Then c Q ,[n..oo] begins 
with (pa[o](Ca[m)) = X x , so r a (n) = X x . 

Inductive step t > 0: We note that removing (or adding) trailing zeros from OR a (n + 1) does 
not change whether it satisfies all three conditions in the theorem. We will assume that 
OR a (n + 1) does not meet the conditions, since the proof is nearly identical if it does meet 
the conditions. 

Let {Xt}^ and {Yi}°Z be standard sequences corresponding to the directive sequences a 
and respectively. Lemma [lOl states that c a [0..n] = y a [o](c^[0..m]) where m > is such 
that 

OR^m + l) = d k ---d 1 . 

Note that • ■ • d\ has t — 1 trailing zeros, so rp(m) is a conjugate of Yt-\ by induction. By 
Lemma [T2l r a {n) is a conjugate of <^ a [o](^t-i) = Xt> completing the proof. 

□ 

Let us continue our example with a directive sequence a starting with 1,3,2,2. Recall that 

c a = 01010 10010 10100 10101 01001 01010 01010 1010 ■ ■ ■ 

Consider the shortest repetition words at positions 23 through 26. These positions happen to give 
illustrative examples of the theorem. 



r Q (23) 


= 


X 


= 


OR Q (24) = 


1101 


r a (2A) 


= 1010100 


x 2 


= 0101010 


OR Q (25) = 


1110 


r Q (25) 


= 01 


x x 


= 01 


OR Q (26) = 


1111 


r Q (26) 


= 10 


x x 


= 01 


OR Q (27) = 


1120 



When n = 23, there are no trailing zeros in OR a (24) = 1101 and we have an odd number of zeros 
between the last two nonzero digits. Hence, r Q (23) is a conjugate of X = 0. Compare this to 
n = 25, where OR Q (26) = 1111 also has no trailing zeros, but the last two ones are adjacent, so 
r a (25) is a conjugate of X-y. We are in a similar situation for n = 24, but with an trailing zero 
so r Q (24) is a conjugate of X 2 . Finally, consider n = 26 where the last two nonzero digits are 
adjacent and we have a trailing zero, like n = 24, but the last nonzero digit is not a one. It follows 
that r a (26) is a conjugate of X\. Although r Q (25) and r Q (26) are both conjugates of X\, they are 
not the same. 
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5 Open Problems and Further Work 



It would be interesting to generalize the result to two-sided Sturmian words, with an appropriate 
definition for local period in two-sided words. We might define a repetition word in iw E W S W 
at position n as a word that is simultaneously a prefix of u>[n..oo] and a suffix of w[—oo..n — 1]. 
Note that if we extend a characteristic Sturmian word c Q to a two-sided word w, the local period at 
position n in w may not be the same as the local period at position n in c a . 

Our main result is about the local period and the shortest repetition word, but Lemma [[2] applies to 
all repetition words at a specific position. Is it possible to extend our result to all repetition words, 
not just the shortest repetition word? Patterns in the lengths of repetition words for the Fibonacci 
word suggest that it is possible, but we do not have a specific conjecture. 
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